Frequency Moments

نویسنده

  • David P. Woodruff
چکیده

DEFINITION Consider a stream (i.e., an ordered list) S = a1, a2, . . . , an of elements ai ∈ [m] def = {1, 2, . . . ,m}. For i ∈ [m], its frequency fi is the number of times it occurs in S. The k-th frequency moment Fk of S, for real k > 0, is defined to be Fk(S) = ∑ i∈[m] f k i . Interpreting 0 0 as 0, we also define F0 this way, so that it equals the number of distinct elements in S. Observe that F1 = n is the length of S. In the database community, F2 is known as the repeat rate or Gini’s index of homogeneity. It is also natural to define F∞ = max1≤i≤m fi. It is usually assumed that n is very large and that algorithms which compute the frequency moments do not have enough storage to keep the entire stream in memory. It is also common to assume that they are only given a constant (usually one) number of passes over the data. It is also assumed that the stream is presented in an arbitrary, possibly worst-case order. This necessitates the use of extremely efficient randomized approximation algorithms. An algorithm A ( , δ)-approximates the kth frequency moment Fk if for any input stream S, Pr[|A(S)−Fk(S)| ≤ Fk(S)] ≥ 1− δ, where the probability is over the coin tosses of A. Here, by A(S) we mean that A is presented items in S one-by-one. Efficiency is measured in terms of the amount of memory and update time of the algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low flow frequency analysis by L-moments method (Case study: Iranian Central Plateau River Basin)

Knowledge about low flow statistics is essential for effective water resource planning and management in ungauged orpoorly gauged catchment areas, especially in arid and semi-arid regions such as Iran. We employed a data set of 20 riverflow time-series from the Iranian Central Plateau River Basin, Iran to evaluate the low-flow series using several frequencyanalysis methods and compared the resu...

متن کامل

Comparison of Artificial Neural Network, Decision Tree and Bayesian Network Models in Regional Flood Frequency Analysis using L-moments and Maximum Likelihood Methods in Karkheh and Karun Watersheds

Proper flood discharge forecasting is significant for the design of hydraulic structures, reducing the risk of failure, and minimizing downstream environmental damage. The objective of this study was to investigate the application of machine learning methods in Regional Flood Frequency Analysis (RFFA). To achieve this goal, 18 physiographic, climatic, lithological, and land use parameters were ...

متن کامل

کاربرد تئوری گشتاورهای خطی در تحلیل تناوب سیل حوزه‌های آبخیز مرکزی ایران

Numerous methods are used in the investigation of floods in catchments such as regional flood frequency analysis. Regional flood frequency analysis relies on physical, climatic and ecological characteristics of catchments and applies statistical methods to study flow records. Hosking and Wallis developed Probability Weighted Moments and presented L-moments statistics as a new tool for flood fre...

متن کامل

کاربرد تئوری گشتاورهای خطی در تحلیل تناوب سیل حوزه‌های آبخیز مرکزی ایران

Numerous methods are used in the investigation of floods in catchments such as regional flood frequency analysis. Regional flood frequency analysis relies on physical, climatic and ecological characteristics of catchments and applies statistical methods to study flow records. Hosking and Wallis developed Probability Weighted Moments and presented L-moments statistics as a new tool for flood fre...

متن کامل

Measuring mass moments and electromagnetic moments of a massive, axisymmetric body, through gravitational waves

The electrovacuum around a rotating massive body with electric charge density is described by its multipole moments (mass moments, mass-current moments, electric moments, and magnetic moments). A small uncharged test particle orbiting around such a body moves on geodesics if gravitational radiation is ignored. The waves emitted by the small body carry information about the geometry of the centr...

متن کامل

Fast Frequency Sweep Technique for E cient

This paper describes a new approach to spectral response computation of an arbitrary 2D waveguide. This technique is based on the Tangential Vector Finite Element Method (TVFEM) in conjunction with the Asymptotic Waveform Evaluation (AWE) technique. The former is used to obtain modes characteristics for a central frequency, whereas the latter employs an eecient algorithm to compute frequency mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009